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ABSTRACT 

Recent studies, using OGLE data for LMC Cepheids in the optical, strongly 
suggest that the period-luminosity (PL) relation for the Large Magellanic Cloud 
(LMC) Cepheids shows a break or non-linearity at a period of 10 days. In this 
paper we apply statistical tests, the chi-square test and the F-test, to the Cepheid 
data from the MACHO project to test for a non-linearity of the V- and i?-band 
PL relations at 10 days, and extend these tests to the near infrared {JHK-h&nd) 
PL relations with 2MASS data. We correct the extinction for these data by 
applying an extinction map towards the LMC. The statistical test we use, the 
i^-test, is able to take account of small numbers of data points and the nature of 
that data on either side of the period cut at 10 days. With our data, the results 
we obtained imply that the VRJH-ha,nd PL relations arc non-linear around a 
period of 10 days, while the K-h&nd PL relation is (marginally) consistent with 
a single-line regression. The choice of a period of 10 days, around which this 
non-linearity occurs, is consistent with the results obtained when this "break" 
period is estimated from the data. 

We show that robust parametric (including least squares, least absolute devi- 
ation, robust regression) and non-paramctric regression methods, which restrict 
the influence of outliers, produce similar results. Long period Cepheids are sup- 
plemented from the literature to increase our sample size. The photometry of 
these long period Cepheids is compared with our data and no trend with period 
is found. Our main results remain unchanged when we supplement our dataset 
with these long period Cepheids. By examining our data at maximum light, we 
also suggest arguments why errors in reddening are unlikely to be responsible for 
our results. The non-linearity of the mean F-band PL relation as seen in both 
of the OGLE and MACHO data, using different extinction maps, suggests that 

Stars: fundamental parameters 



used to determine Hubble constant. The current most 
widely used PL relation is based o n the Large Magellanic 
Cloud (LMC) C epheids (e.g., see lFreedman et alJlioOll : 
ISaha et al.l200l)) . The Cepheid PL relation has long been 
considered to be a linear function of log(P) within the 
range of log(P) ~ 0.3 to log(P) ~ 2.0, where P is the 



this non-linearity is real. 
Key words: Cepheids - 

1 INTRODUCTION 

The Cepheid period-luminosity (PL) relation plays a cru- 
cial role in the distance ladder which can ultimately be 
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pulsation period in days. Current versions of the LMC 
PL re l ations can be f ound, e.g.. in [M^^r^^^Vgg^^K 



'Ij re l a,tions can be r ound, e.^., mlM ^qore ^ i<r^^(;im^r 
199lVTanvir' ("19 9711 .iGieren et al.l(IT998ll . lUdalski et al. 



y_g99a) and Pcrs son et al 

However, recent stud i es llTammann fc ReindJ 
120021 : iTammann et alJ l2002l : iKanbur fc Ngeowl l20oJ ^ 



Isandage et alJl2004l stronglv suggest that the LMC PL 
relation is not linear, as there are two PL relations for 
the short (P < 10 days) and the long (P > 10 days) 
period LMC Cepheids, respectively. These studies are 
mainly based on the optical data (SV7-band) from 
the Optical Gravitat ional Lensing Experiment (OGLE, 
lUdalski et al] liaiQaV) database for the fundamental 
mode Cepheids. The non-linear V- and /-band LMC 
PL relations are further supported by the results from 
a rigorous statistical test, the P-test, as described in 
[Kanbur & Ngcow ( 20o4 hereafter KN). This P-test is 
sensitive to the number of data points and the nature 
of the data on either side of 10 days. This sensitivity is 
such that if, either the number of data points is small or 
the data has a large scatter, the slopes on either side of 
10 days are not well defined and the test returns a non- 
significant result. We describe this property of the P-test 
in detail in Section 3. Hence the work of KN, which used 
the OGLE data alone and found marked non-linearities 
in the PL (and PC, period-colour) relation s in the LMC 
at a period of 10 days is in contrast to ICieren et al.l 
l)200fJl . who state that the OGLE data alone do not 
show any non-linear behavior or break at a period of 
10 days. Since th e OGLE Cepheid dat a are truncated 
at log (P) ~ 1.5 jUdalski et al.lh999ah. ISandage et al.l 
l|2004) used additional PV7-band data that are available 
from the literature, especially those with log(P) > 1.5, 
to further support the existence of two PL relations in 
the LMC Cepheids. Compared to previous studies, the 
large number of LMC Cepheids with high quality photo- 
metric data, together with the estimation of extinction 
{E[B ~ V]) to individual Cepheids, given in the OGLE 
database has made the detection of non-linear LMC PL 
relation becomes possible. 

The non-linearity of Cepheid PL relations has been 
recognized for some time in the literature, as the 
mean magnitudes for some of the long period Cepheids 
(log[P] > 1.5) do not follow the best-fit lines. In fact, 
some of the earlier studies (including the works of 
Kukarkin, Shapley & Fernie) on the Cepheid PL rela- 
tion have suggested or used a quadratic form, M = 
a -\- /91og(P) -I- 7[log(P)]^, to describe the PL relation 
l)Fernielll96?t ). For example, a semi-empirical derivation 
of the quadratic PL re l ations in the B- and F-band 
can be found in iFernij il967fl . In addition, the com- 
posite PL relations at mean and maximum light con- 
structed with Cepheids in different galaxies, as pre- 
sented in ISandage fc TammannI lll968h . also show some 
curvature of the PL r elatio n s. In terms of theor etical 
modeling. iBono et al.l lll999h . ICaputo et al.l (|200Cf) and 
iMarconi et alj i2005tr have fit quadratic PL relations to 
the theoretical periods and magnitudes obtained from the 
pulsational modeling. The non-linearity of the PL rela- 
tions, however, can also be explained with the broken PL 
relations: there are two PL relations, for short and long 
period Cepheids respectively, with a break at a specific 



period (e.g., at 10 days). Perhaps the earliest idea for the 
broken PL rela tion was prop osed by Kukarkin in 1937, 
as quoted from iFernij lll969h : "...he concluded that the 
relation was definitely non- linear... but the best fit was by 
two straight lines of different slope intersecting at a pe- 
riod of 10 days". The prese nt-day version of the broken 
PL relat ions is presented in [ Tammann fc ReindJ (|20o3), 
KN and ISandage et"ai] \2Q0^ with the OGLE data, as 
mentioned in the previous paragraph. These broken PL 
relation are also compared w ith the theoretical predic- 
tions in |m arconi et al.l i2005l) , which show a quite satis- 
factory agreement. Note that ICaputo et a l. ( 1999) have 
fit a broken PL relation to their results obtained from the 
theoretical models, but with a break at log(P) = 1.4. 



Besides the non-linear PL relations seen in the LMC 
Cepheids, the SMC fundamental mode Cepheids also 
show a change of slope for Cepheid s with periods les s 
than 2 days from the EROS-2 data llBauer et al.lll999^ . 
The authors carefully examine the possible errors or bias 
in the data that lead to this finding, and conclude that 
the non- linear PL re lations seen in SMC Cepheids is real. 
iBaraffe et al.l il998l) explained that this change of the 
slope for short period (less than 2 days) SMC Cepheids is 
due to evolutionary effects: this is because the blue loops 
for these short period SMC Cepheids do not fully extend 
into the instability strip. However, the non-linearity of 
the LMC PL relation at ~ 10 days period is believed to 
be independent of this evolutionary effect. In this paper, 
we concentrate on studying the 10 days non-linearity for 
the LMC PL relation and do n ot suggest tha t this i s the 
same phenomena described bv lBaraffe et alJ il998l) but 
for the higher metallicity LMC Cepheids. 



The motivation of this paper is to extend the work 
of KN into the study of the linear and/or non-linear na- 
ture of the PL relations at /around 10 days for the fun- 
damental mode LMC Cepheids. It it important to ver- 
ify the non-linearit y of the LMC PL relation a t 10 days 
from the studies of [Tammann fc Reindll i2002h . KN and 
ISandage et all hOQ^ . which mainly analyzed the OGLE 
data, with an independent dataset. Hence, we examine 
the non-linearity of the optical LMC PL relation with 
the Cepheid data o btained from the MACHO database 
jAlcock et alj|2000h in this paper. In addition, we also 
investigate the LMC PL relations in the near infrared 
(NIR) bands (i.e., Ji/ZC-band) to see if the NIK PL rela- 
tions also show a non-linearity. We apply statistical tests 
to investigate the possible non-linearity of the LMC PL 
relation in these bands. We formulate this non-linearity 
in three ways and examine if the data are more consistent 
with a linear or non-linear PL relation. 



In this paper, we only use the published mean mag- 
nitudes (and periods) for the LMC Cepheids from the 
literature. The data used in this paper is described in 
Section 2. The statistical analysis and the results are pre- 
sented in Section 3 and 4, respectively. The conclusion fc 
discussion will be followed in Section 5. 
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Figure 1. The extinction corrected V- and J-band PL relations with all 1330 Cepheids in the sample. The filled circles are the 8 
outliers as described in the text. The error bars for the outliers are the quadrature sum of the photometric errors and the extinction 
errors. The error bars for other Cepheids are omitted for clarity. 



2 DATA AND EXTINCTION 
CORRECTIONS 

The data for the 1330 fundamental mode LMC Cepheids 
are adopted from Nikolacv ot al. (2004). This includes 
the V- and R-hanA mean magnitudes, as well as the pe- 
riods, from the MACHO database and the J-, H- and 
if-band mean magnitudes f rom the Two Mi cron All Sky 
Survey database (2MASS. ISkrutskid 11998*1 . The mean 
magnitudes in the V- and _R-bands are computed from 
the complete light curves with the MACHO photomet- 
ric data. The (random phase) magnitudes in J-, H- and 
if-bands are from the 2MASS single epoch observations, 
hence the mean magnitudes in these NIR bands are ob- 
tained with a random-phase correction function. The de- 
tails for obtaini ng the mean magnitud es in all these bands 
can be found in lNikolaev et all i2004l) . To check the accu- 
racy of these JHK mean magnitudes, one of us (Nikolaev 
et al, 2005 - in preparation) has compared the JHK mean 
magnitudes from t he 2MASS data (afte r random-phase 
correction) with the lPersson et all ll2004) data, which are 
obtained from the well-sampled {JHK) hght curves. For 
the 33 matched Cepheids in t he MACHO/2MASS sample 
and the lPersson et all (|20o3) sample, the average differ- 
ence in the mean magnitudes are: Aj = —0.022 (with std 
= 0.070, where std = standard deviation), Ajj = —0.012 
(std = 0.073) and A^ = -0.015 (std = 0.068). These are 
completely consistent with the zero difference. More im- 
portantly, the plots of the difference between the means 
in these bands as a function of period (not shown) do not 
show any trends with period, hence the JHK mean mag- 
nitudes from the 2MASS database will not seriously affect 
our analysis and result s. We did not use other datasets 
in VRJHK-hanAs, as in lSandaee et al.1 (|^0Q4'), to perform 
the main analysis in order to keep our dataset as consis- 



tent and homogeneous as possible (however, see Section 
5.2). 

The next step in the analysis is to obtain 
the extinction-corrected mean magnitudes for these 
Cepheids. This was done by applying an extinction 
map with the following values of R\ (ratio of total-to- 
selective reddening , and R\ = AxlE\B — V]), in order to 
be consistent with iNikolaev et alJ ll2004f) : Rv,r,j,h,k = 
{3.12, 2.56, 0.90, 0.53, 0.34|. The LMC extin ction map 
we used is taken from IZaritskv et al.l (|200^. In brief, 
given the coordinates for each Cepheid and a search ra- 
dius r, the extinction map returns the average value of 
extinction, Ay ± ctav (where aAv = std/y/N), from the 
stars bounded within the search radius. The extinction 
map allows us to choose the Av obtained from cool stars 
(5500K < Te// < 6500K), hot stars (12,000K < Te// < 
45, OOOK) or both. Since Cepheid variables are cool stars, 
we only use the extinction values from the cool stars. 
We use r = 2' to ensure that there are sufficient num- 
bers of stars within the search radius to obtain the Ay 
(usually more than 10 stars) ^. However, there are two 
Cepheids with locations beyond the extinction map and 
one Cepheid without any cool stars within the r = 3' 
radius. For simplicity, we assume Ay — 0.3 ± 0.1 mag 
(roughly the mean extinction of LMC) for these three 
Cepheids. After the values of Av for the Cepheids are 
obtained, the extinctions in other bands are calculated 
with the following relation: A\ = {■^)Av with the er- 
rors a Ax = i'^)'^Av, where A denotes the passband. 

We consider the following three samples in this 
study: 



Only two Cepheids with enclosed stars that are less than 10 
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Sample A - This sample consist of all 1330 Cepheids in 
our dataset. 

Sample B - We then remove 8 outliers out of 1330 
Cepheids (all of these outliers have log[P] < 0.75) from 
Sample A, as they are fainter by more than ~ 1.2mag. 
and ~ l.Omag. in V- and 7?-band, respectively, to the 
fitted PL relations at the same period. The locations of 
these outliers in the PL relations are plotted in Figure 
It is possible that the extinctions for these outliers are 
underestimated because they are not obvious in the NIR 
PL relations (see the right panel of Figure . Another 
possibility is that they are Type II Cepheids that are 
mis-identified as the classical Type I Cepheids. However, 
it is clear that these outliers should be removed from our 
sample. Hence Sample B consists of 13 2 2 Ceph eids. 

Sample C - Following lUdalski et al ] |l9993)> 

we also 

remove the Cepheids with log(P) < 0.4 from Sample 
B. This is to guard against t he possible con tamination 
from first overtone Cepheids dUdalski et al ] |l999a) . as 
well as other types of variables such as t he anomalous 
Cepheids and the double-mode Cepheids jAlcock et al.l 
Ll999). In addition, this choice of the period cut will also 
guard against the possible change of slopes for short 
(P < 2 days) period Cepheid s, as seen in the SMC 
Cep heids llBauer et al.ll l999: Ud alski et ai]|l999a^ . Note 
that lUdalski et al .1 (Il999a) has applied the period cut of 
log(P) = 0.4 to the OGLE LMC Cepheids to derive the 
PL relations tha t were used in other st udies, such as the 
Hq Key project jFreedman et al.ll200 j) . Hence the same 
period cut was also applied in previous studies of the 
non-linear LMC PL relations ("Tammann fc Re indlll2002l: 
[Kanb ur fc Ngcow 2004; gandagc ct al. 2004). Further- 
more, the period distribution of the LMC Cepheid s shows 
a sharp break at log(P) ~ 0.4 jAlcock et al.ll999^. This is 
probably due to ev olutionary effects jBaraffe et ai]ll998l: 
lAlcock et al] ll999l). We contend that these reasons jus- 
tify the removal of Cepheids with log(P) < 0.4, and leaves 
1216 Cepheids in this sample. Our use of the P-test, as 
given in the next section, ensures that such a period cut 
will not affect our results. 



3 ANALYSIS 

In order to study the linear and/or non-linear nature of 
the LMC PL relation, the data presented in the previ- 
ous section are fitted with the (weight ed) least squares 
regression (see, e.g.. IPress et al.lll992h . Ordinary least 
squares regression, OLS{Y\X), has been extensively used 
in the literature to obtain the fitted PL relations from ob- 
servations (for examp l es. see | M adore fc Frccdmanlliggil: 
Laney fc Stobielll994t Icieren et al.lll99& .Udalski et all 



1999al: iKanbur fc Ngeowl 120041: ISandage et alJ 12004 
^reednm^^dTl20ML and many other papers on PL re- 
lation using the Galactic Cepheids) as we l l as from the- 
oretical modelings (as in|Bono et al]ll999l : ICaputo et al.l 
I2OOOI: iBaraffe fc Alibertll200ll ). Thi s is because th e re is 
a good physical theory (see, e .g., ISandagel Il958t ICoxl 
ll980l:lMadore fc Freedmanlll 99l') suggesting why the pe- 
riod is the independent variable and is measured essen- 
tially without errors, hence the OLS{Y\X) regression can 
be applied. In terms of observations, the errors of the 



periods for the MACHO Cephei ds used in this pape r 
(see Section 2) is about 10"''P jNikolaev et alJl2004^ ■ 
which is comparable to the accuracy reported by the 
OGLE team {gp ~ lO^^P, Udalski ct al. 1999^, as well 
as other high quali ty photometric observations (e.g., see 
iMoffett et"ai]ll998l. and some exa mples f rom the Ga lac- 
tic Cepheids i n ICoulson et aLlliJs S. Shob brooklll992l and 
iBersier et aI]ll994^ . The OLSiY\X) regres sion is also 
recommended for the distanc e scale studies ilsobe et alJ 
ll990l:lFeigelson fc Babulll992^ . 



The weights (w) that are used in the regressions are 
the quadrature sum of the photometric errors in mean 
magnitudes (cta), the errors from the random phase cor- 
rections that are only applicable to JHK-hcinAs (o"rp), 
the errors from extinction (o■A;^), and the intrinsic dis- 
persion due to the finite width of the instability strip 
(a/s). Hence, w{i) = al{i) + (jlp{i) + cr^^(i) -|- a'js- 
The dispersions of the PL relations in each band from 
iMadore fc Freedm an' ("19911) can be initially adopted for 
(Jis. However, a co mparison of th e dispersions g iven i n 
Madore fc Freedmani lil99il) with ICieren et al.l Jl998 " 



Perasor^^dT ll2004l) and ISandage et al.1 ll20oll suggest 



t hat the F-R-band and the J HK-hwiA dispersions given 
in lMadore fc Freedmani Jl99J) should be scaled by ~ 0.73 
and ~ 0.88, respectively. Hence we adopt the following 
values for the intrinsic dispersions: ais{V, R, J, H, K) = 
{0.21, 0.18, 0.14, 0.12, O. llj. The adopted g/s suc h 
as those originally given in lMadore fc Freedmani il99 J) , 
does not affect the results from the following statistical 
tests. We also assume that all LMC Cepheids are at the 
same distance so that there is no distance variation in 
the weight formula. 

One way to check for a non-linearity on the LMC PL 
relation is through a residual plot. If the one-line regres- 
sion is adequate, then the residuals should be populated 
(roughly) evenly around the zeroth residual line (i.e., in- 
dependent of period) . On the other hand, if the residuals 
show a trend, or some trends, in the residual plot, then 
more parameters are needed to fit the data. The resid- 
uals of the MACHO T/-band data from previous section 
are plotted as function of log(P) in Figure (21 assuming 
that the one-line regression is adequate. A linear fit of 
the residuals with log(P) > 1.0 gives non-zero slopes of 
-0.66±0.27, -0.65±0.27 and -0.67±0.27 from Sample 
A, B and C, respectively. These slopes clearly indicate 
that the trends seen in Figure |5| are significant. Hence a 
one-line regression may not be adequate to fit the MA- 
CHO F-band PL relations. Similar residual plots in other 
bands are presented in the right panels of Figure |1] 

To investigate the possible non-linearity of the PL 
relations, we applied the chi -square test and the P-test 
(as given in IWeisberdligiol. and used initially in KN) 
using the weighted least squares results. The null (Ho) 
and the alternate (Ha) hypothesis represent the reduced 
and the full models, respectively, where the reduced mod- 
els are obtained by setting some parameters i n the full 
mode ls to certain specific values (e.g., zero, see lWeisberd 
I198OI) . Hence our reduced model is a one-line regression 
(2-parameter) fit to the entire period range, and the null 
hypothesis. Ho, is that one-line regression is adequate to 
describe the data. In contrast, Ha is that the full model is 
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Sample A 



Figure 2. The residuals for the K-band PL relation as a func- 
tion of log-period from Sample A. The residuals are obtained 
assuming the one-line regression is adequate. The solid lines 
are the linear fits of the residuals as function of log-period, 
separated at 10 days. The dashed line is the zeroth residual 
line. The residual plots from Sample B &; C are similar to this 
plot. 



necessary to fit the data. Here, we consider tiiree different 
kinds of full models: 

(i) A two-line regression (4-parameter) witii a break 
at a cliaracteristic period, Po, i.e., 



m = a^Iog(P) -|-6^,P > Po, 
m = a-^Iog(P) < Po. 



(1) 



(ii) A piecewise continuous regression (3-parameter) 
tliat forces the two-line regression to be continuous at 
Po, i.e., 

m = c + d X min[log(P), log(Po)] 

+e X max[0, log(P) - log(Po)]. (2) 

(iii) A quadratic regression (3-parameter) of the form 

of; 



= Q + /3Iog(P)+7[log(P)] = 



(3) 



Although we consider these three models, our main in- 
terest is in the investigation of a change of slope in the 
LMC PL relation for short and long period Cepheids. It 
is clear that two lines of differing slope, continuous at Po 
and even perhaps with a small discontinuity at Po, are ap- 
proximately a quadratic with a local minimum / maximum 
derivative at Po. 

To apply the chi-square test and the P-test, we then 
calculated the weighted error sum of squares, SSE = 
X]['T^(i) — 'rn.{iy^ /'w{i), for both Ho and Ha, where m(i) 
and rh{i) are the observed magnitudes and the fitted mag- 
nitudes from weighted least squares for ith Cepheid, re- 
spectively. Under Ho, the difference of the SSE: 



Table 1. The critical values for the chi-square test and the 
P-tcst. 





p = 0.05 p 


= 0.01 




values 




1 


3.84 


6.63 


2 


5.99 


9.21 




F values 




1 


3.84 


6.63 


2 


3.00 


4.61 



u = 1 for the alternate hypothesis given in equation (2) & 
(3); u = 2 for the alternate hypothesis given in equation (1). 



X 



SSE{Ho) - SSE{Ha 



(4) 



follows approximately a chi-square distr ibution with v = 
npL — riRD degrees of freedom (e.g, see IWeisberdligiol) . 
where npL & nao are the number of parameters in full 
and reduced models, respectively. For a large number of 
data points {N), this is essentially the same as apply- 
ing a standard P-test (which many regression packages 
produce) under weighted least squares, where 



P : 



SSE{Ha)/{N -ufl) 



(5) 



distribution un- 



which is approximately a Fi^_ 

der the null hypothesis (see also lKanbur fc Ngeowll2004 ) . 
where is given in equation (4). The corresponding 
probability (p) for the and the P- values, under Ho, 
can be obtained from consulting the standard statisti- 
cal books or softwares. A large value of and/or P 
(hence small p) indicates that the null hypothesis can be 
rejected, and therefore the data is more consistent with 
the alternate hypothesis. Tabled summarizes the x'^ and 
P-values that correspond to the case oi p = 0.05 and 
p = 0.01. In case of the P-test, p(P) < 10"^ for P > 7 
with large number of data points (say, A'^ > 500). 

The P-test relies on the assumptions that the resid- 
uals are normally distributed and are homoskedastic. KN 
tested the OGLE data for these assumptions. With such 
large numbers of data departures from normality are 
unlikely to significantly influence our results. The chi- 
square test, as formulated above, does not depend on ho- 
moskedasticity and hence we use it in this study to sup- 
plement our P-test results. Perhaps a simpler chi-square 
test is the following: compare the chi-square statistic for 
a one- and two-line regression respectively. However, this 
approach does not have the necessary power to distin- 
guish between the two cases and hence we use the chi- 
square test given in equation (4) above. 

Furthermore, the P-test we use, as in equation (5), 
to test for changes in slope on either side of a period 
cut, is sensitive to the number and nature of Cepheid 
data (such as the distribution of the data) defining the 
slope on either side of the period cut. Equation (5) con- 
tains an explicit dependence on these factors. Since the 
variance of the fitted slope, a, is Var(a) = 
where n is number of parameters and Sxx = 
(a; is the mean value of j: = log[P]), hence another 



(N-n) Sxx ' 
X? 
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Table 2. Testing the significance of the F-test with various period cuts. 



OGLE MACHO 
log(Pi) log{P2) AfiogP<i AfiogP>i F p{F) iViogP<i AfiogF>i F p{F) 



0.4 


1.5 


585 


49 


8.867 


0.000 


1147 


69 


12.32 


0.000 


0.6 


1.4 


268 


45 


8.799 


0.000 


494 


63 


10.62 


0.000 


0.7 


1.3 


136 


41 


6.904 


0.001 


249 


58 


5.438 


0.005 


0.8 


1.2 


72 


30 


0.279 


0.757 


141 


51 


1.124 


0.327 


0.9 


1.1 


25 


15 


0.909 


0.412 


56 


29 


0.029 


0.972 


0.9 


1.5 


25 


49 


1.000 


0.373 


56 


69 


0.225 


0.799 



way of writing SSE as used in equation (5) is SSE = 
Ya,r{a){N — n)Sxx ■ Thus the _F-test automatically builds 
into it the amount of precision in both slopes, which in 
turn is a function of the data used and the amount of vari- 
ability about the regression line. Alternatively, one may 
think of the _F-test as F = t'^ , where t is equal to the 
difference in the two slopes divided by the standard error 
of the difference. Thus if we have another dataset with 
the same number of data points as ours but, for example, 
a larger (or smaller) scatter, then for this other dataset 
the slopes are less (or better) well defined with larger (or 
smaller) variance and the F-test statistic would generally 
decreases (or increases). In another situation, if we add 
(or remove) some data from our sample (assume the old 
and new samples have the same distribution) , then gener- 
ally the slopes will be more (or less) accurately estimated 
and the _F-test statistic would increases (or decreases). 
We say "generally" because the data that is added or 
removed could have a large or small scatter which can, 
in some cases, offset the change in the amount of data. 
Thus the _F-test statistic is sensitive to both the number 
of data points and the nature of the data. 

We have verified this by using the LMC OGLE data, 
as used in KN, and formulating a F-test for a difference 
in the PL slope between short (P < 10 days) and long 
(P > 10 days) period Cepheids. However we also make a 
cut at periods Pi and P2 such that Pi < 10 < P2, thus 
decreasing the number of Cepheids available to define the 
slope on either side of 10 days. The results are presented 
in Table H for the OGLE F-band data used in KN, as 
well as the MACIfO F-band data as described in Sec- 
tion 2 (the Sample C). This table shows that the P- value 
generally decrease, or the significance of the results gen- 
erally goes down (with p- values increase) , as the number 
of Cepheids on either side of the break point goes down^ . 
It also depends, to some extent, on the nature of the data 
that was removed or added. We see that when we have 
less than about 30 Cepheids on the long period side, the 
significance is reduced and the P-test would indicate that 
the data are consistent with a single line. Similarly when 
we have less than about 25 Cepheids on the short pe- 
riod side. With such few Cepheids the slopes are not well 
defined with large errors. This gives us confidence that 
when we do get a significant result, that result has some 
meaning. 



3.1 Choosing Po 

In th e recent studies of the non-linear LMC PL rela- 
tions jTammann fc Reindill2002l: iKanbur fc Ngeowll2004l: 
ISandaee et alJl200% the character i stic p eriod Po is cho- 
sen to be 10 days. ISandage et al.l ||2004) has discussed 
the significance of the discontinuity at 10 days, which 
will not be repeated here. However, the value of Po can 
be found from equation (2) by making log(Po) another 
parameter in the least squares fit. To find Po, we ap- 
plied non-linear estimation procedures, as given in the 
SAS package, to the V-band data from Sample C. The 
result is log(Po) = 0.934, with the upper and lower 95% 
confidence limits of 1.089 and 0.778, respectively'^ . This 
res ult is fully consistent with the choice o f Po = 10 days 
in iTammann fc ReindJ tOO'j) . KN and ISandaee etaJl 
II2004 '1. Other evidence to support the choice of Po = 10 
days include: 

(i) The amplitudes and the Fourier parameters from 
the light curves of the short period (P < 10 days) 
Cepheids are radically different from t heir long pe- 
riod (P > 10 days) co u nterpa rts fsee ISimon fc Led 
19811: ISimon fc Moffetj Il985l: l Andreasenl Il988l: 
Udalski et alJ Il999a|: iKanbur et alJ l2002l : iNgeow et alJ 
20031 : ISandage et alJl20ol ' and references therein), where 
the plots of the amplitudes and Fourier parameters 
as a function of log(P) show an abrupt change at 10 
days. This is a well established fact in stellar pulsation. 
There is a well-known explanation for this change in 
the Fourier parameter s at 10 days : the Hertzsprung 
progre ssion (see, e.g. ISimon fc Led Il98ll : ^ono ct alj 
l2000bl . and references therein). We do not imply that 
this theoretical explanation for the change in Fourier 
parameters at a period of 10 days is responsible for 
possible changes in the Cepheid PL relations at the same 
period, merely that there is a precedent in the Cepheid 
literature for "something happening" at a period of 10 
days. 

(ii) In a separate paper (Ngeow & Kanbur, 2005 - in 
preparation), we study the multi-phase PC and PL rela- 
tions for the OGLE LMC Cepheids. This is because the 
PL and PC relations at mean light (as, e.g., presented 
in iTammann fc Reindll 120021: IKanbur fc Ngeowl l2004 
ISandage^^^|]2004l) are the average of all o ther phases 
jKanbur fc N geow"2004l: IKanbur et a l."2004\ From this 
study, we found that both of the LMC PL and PC 



^ Note that the peak of the period distribution for both sam- ^ The result using Sample A is: log(Po) = 0.981, with the 95% 

pies is located around log(P) ~ 0.5, see Section 5.2. confidence limits of 1.148 and 0.815. 
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relations provide compelling evidence of a strong non- 
linearity at period of 10 days at a pulsation phase of 0.82 
(where phase zero corresponds to maximum light). A pre- 
liminary result from this study is presented in Figure |31 
and the details will be given elsewhere. 

(iii) The wo r k oflTammann fc Reindll ll2002ll . KN and 
ISandaee et al.l i2004l) strongly suggests that the LMC 
(optical) PC relation in {B — V) and/or {V — I) colour 
is also non-linear with a br eak at 10 days. This can b e 
clearly see from figure 2 of lTammann fc ReindJ ll2002l) . 
figure 2 of KN (which also include the PC relations at 
maxim um and minimum light), figure 1 of lSandage et all 
ll2004h and also the left panels of Figure]^ in this paper. 

(iv) The behavior of the amplitude-colour (AC) rela- 
tions for the LMC Cepheids are difi'erent for the short 
(P < 10 days) and long period Cepheids at maximum, 
mean and minimum light (KN). This is because the PC 
and AC relations are connected as described in KN. 

(v) The residuals plots using one-line regression shows 
a trend at long period range (log[P] > 1.0) for the V- 
and -R-band PL relations (Figure|5|& right panels of Fig- 
ure m , as well as the PC relations (see figure 4 from 

i vi) The empir i cal LM C instability strip, as presented 
Sandage et alJ ll2004ll . clearly show a break at a pe- 
riod of 10 days. The properties of the LMC instabil- 
ity strip, such as the ridge lines of the strip and the 
slope of constant-period lines, also different for the long 
(log[P] > 1.0) and short period Cepheids. 

Hence, we follow the existing l iterature llTamin ann et all 
12001 iKanbur fc Ngeowl l2004l: ISandage et alj 12004 and 

adopt Po — 10 days in this paper. Note that due to the 
intrinsic dispersion of the PL relation (because of the fi- 
nite width of the instability strip), the exact location of 
the Po is difficult to accurately determine with statistical 
means from the data. We emphasize that the confirma- 
tion of Po ~ 10 days has to be done with the stellar 
pulsation modeling, which is beyond the scope of this 
paper. 



4 RESULTS 

In Table 13 - IHl we present the fitted results for one- 
line, two-lines, piecewise continuous and quadratic re- 
gressions, respectively, for each of the three samples 
considered in this paper. It is worthwhile to point 
out that even though the dispersion values (given by 
y'J2[^(i)~''T^{'i')]'^/iN — n), which is not the same as 
the SSE that enters the chi-square test and the P-test) 
given in the last column of these tables are comparable to 
each other'', though with a slight improvement in Table 
El&IHla'S compared to Table El This, however, does not 
mean that the four regression models describe the data 
equally well. The SSE is approximately the dispersion 
multiplied by the number of data points so differences 
between the models are more apparent. Further, a sta- 
tistical test based purely on a comparison of the disper- 
sion does not have the required power and is not a good 

We thank the anonymous referee to point this out. 



Table 3. The results of the one-line regressions in the form of 
m = alog(P) + b, where rr is the dispersion of the regression. 



Bandpass 


a 


b 




tj 




Sample A, 


= 1330 






V... 


-2.669 ±0.031 


17.074 ± 0.020 


0, 


,285 


R... 


-2.826 ±0.027 


16.860 ±0.018 


0. 


,244 


J... 


-3.080 ±0.022 


16.277 ±0.014 


0. 


,130 


H ... 


-3.175 ±0.020 


16.046 ± 0.013 


0. 


,114 


K ... 


-3.156 ±0.020 


15.973 ± 0.014 


0. 


,129 




Sample B, N 


= 1322 






V ... 


-2.672 ±0.031 


17.066 ± 0.020 


0, 


,257 


R... 


-2.828 ±0.027 


16.853 ± 0.018 


0, 


,220 


J ... 


-3.080 ±0.022 


16.275 ± 0.014 


0, 


124 




Q 1 7/1 _i_ n non 
— o.i (4 it U.UZU 


iO.U44 it U.Uio 


n 
U. 


111 


K ... 


-3.156 ±0.020 


15.972 ± 0.014 


0. 


128 




Sample C, N 


= 1216 






V ... 


-2.699 ±0.035 


17.087 ± 0.023 


0, 


,259 


R... 


-2.852 ±0.030 


16.871 ± 0.020 


0, 


,222 


J ... 


-3.088 ± 0.024 


16.281 ± 0.016 


0. 


,124 


H ... 


-3.169 ±0.022 


16.040 ±0.015 


0, 


,110 


K ... 


-3.159 ±0.021 


15.974 ±0.015 


0, 


124 



statistical test to apply to our data. This is due to the 
following reasons: (a) the dispersion is dominated by the 
intrinsic dispersion {cis) because of the existence of the 
instability strip; (b) the majority of the Cepheids are in 
the short period range which dominates the dispersion; 
and (c) more importantly, the non-linear signature of the 
mean light LMC PL relation is not obvious as shown in, 
e.g., FigureEl or with a large quadratic term (7) in equa- 
tion (3). Therefore, it is necessary to apply the chi-square 
test and the P-test to detect such non-linear signature 
from our data. 

The extinction corrected VRJHK-h&nd PL relations 
are presented in the left panels of Figure |1] (from Sample 
C for illustrative purposes, together with the fitted two- 
lines regressions. The PL relations from the other two 
samples, as well as other forms of regressions, are similar 
to Figure 0J. From Figure 0] the V- and 7?-band PL re- 
lations imply that there is a break or sharp non-linearity 
at/around 10 days, marginally in the J-band PL relation, 
but not obvious in the H- and if-band PL relations. The 
right panels of Figure 2] compare the residuals of the fit 
as a function of period using one-line and two-line regres- 
sions (again the residual plots from the two other forms 
of regressions are very similar to these plots). From the 
residual plots, both of the V- and P-band residuals show a 
trend at log(P) > 1.0 when using the one-line regression. 
These trends are removed if two-line regressions are used 
to fit the data. These findings also support a break in the 
PL relation at a period of 10 days. Similarly, the J-band 
residuals show a marginal trend, which is not obvious in 
the H- and K-ha.nA. 

Both of the chi-square test and the P-test were then 
applied to the samples. The results from the chi-square 
test and the P-test are summarized in Table |7| fc |H1 re- 
spectively. These results can be compared with Table Q 
to decide if the null hypothesis can be rejected. If the 
and/or the P values given in Table |7|&|H| are smaller than 
the values corresponding to p = 0.05, then the underly- 
ing PL relation is linear. In contrast, if the and/or the 
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Figure 3. LMC extinction corrected PC (left panel) and PL (right panel) relations at phase of 0.82 (where phase zero = maximum 
light), obtained from using the OGLE photometric data. A discontinuity at/near lOdays is clearly visible. 



F values in these tables are larger than the values cor- 
responding to p = 0.01, then the underlying PL relation 
is non-linear. For the the and/or the F values that 
fall in-between the values corresponding to p = 0.05 and 
p — 0.01, we classify the PL relation to be marginally 
linear. In short, from TableEI&|H]we found that: 

(i) For Ha = two-line and piecewise continuous regres- 
sions, both of the chi-square test and the F-test show 
that the VRJ & i?-band data (in all three samples) are 
more consistent with the two-line regression, and the null 
hypothesis with one-line regression can be rejected with 
very high confidence level (> 99.9%). In contrast, the K- 
band data (in all three samples) is consistent with the 
one- line regression from the chi-square test. The F-test 
results for the i("-band data also show that the .ftT-band 
PL relation is marginally linear^. 

(ii) For Ha ~ quadratic regression, the results are 
quite different to the two other types of regression. The 
linear/non- linear behavior is quite different in VRJ- and 
_ff-band data across the three samples. From the chi- 
square test, the V7?J-band PL relations are linear in 
Sample A, then become marginally linear in Sample B 
and non- linear in Sample C. The //-band PL relation is 
marginally linear in Sample A but non-linear in both of 
the Sample B & C from the chi-square test. In terms 
of the _F-test, Sample A shows that the V- & 7J-band 
PL relations are linear, marginally linear for the J-band 
PL relation and non-linear for the //-band PL relation. 



^ Note that for the K-band results, the _F-values for 
Ha =piecewise continuous regression are larger than the up- 
values from Ha =two-line regression. Hence the .fC-band PL 
relation will be more consistent with the piecewise continuous 
regression if p = 0.01 is chosen and vice versa. 



In Sample B, the F-band PL relation is still linear, but 
the 7?-band PL relation becomes marginally linear, and 
both of the J- and _ff-band PL relations are non-linear. 
The VRJ- and _ff-band PL relations become non-linear in 
Sample C. In contrast, the K-hand PL relations are linear 
in all three samples from both of the chi-square test and 
the _F-test. 

In terms of the sample selection, the results from 
Sample A are not a good choice as the final adopted re- 
sults. This is because Sample A suffers from obvious out- 
liers as shown in Figure Further, these outliers have 
made the quadratic fit for the V7?J-band PL relations to 
be more linear (as the quadratic term in equation [3], 7, 
is consistent with zero, see Table |SJ, in contrast to the 
results from the two-line or piecewise continuous regres- 
sions. Hence the results from Sample A is not considered 
further. Both of the Sample B and Sample C are freed 
from the obvious outliers, with the difference that Sam- 
ple B includes Cepheids with log(P) < 0.4 but Sample C 
excludes these very short period Cepheids. The statistical 
tests with two-line and piecewise contiimous regressions 
show consistent results for both samples, but the results 
with quadratic regression are slightly different as men- 
tioned previously. This implies that neglecting Cepheids 
with log(P) < 0.4 does not influence our results. We favor 
the results from Sample C because there are some phys- 
ical reasons to exclude the Cepheids with log(P) < 0.4, 
as discussed in Section 2 (the contamination from over- 
tone Cepheids and the possible changes of slope due to 
evolutionary effect). 

Among the three full non-linear models, the true 
form of the non-linearity is difficult to distinguished sta- 
tistically from the data. This has to be done together with 
the theoretical calculations, which is beyond the scope of 
this paper. However, we favor both two-line and piece- 
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Figure 4. Left (a): The extinction corrected VRJHK-hand PL relations for Sample C. The dashed and solid lines arc the fitted 
one-line and two-line regressions, respectively. The error bars are the quadiatm e sum of the photometric errors in mean magnitudes 
{(Jx), the errors from the random phase corrections that are only applicable to JHK-hands {<Jrp) and the errors from the extinction 
i'^Ax)- Right (b): The residuals from using one-line (top panel) and two-line (bottom panel) regression fits. The dashed line 
indicates the zero residual. The dotted lines are the Icr dispersions. 
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es, t T,5- 0.5; t fs- 



Figure 4. continued. 



Table 4. The results of the two-line regressions in the form of m = a(s,L) log{-P) + l>(s,L)t where cr(s,L) is the dispersion of the 
regression. The subscripts s,L are refereed to the short (P < 10 days) and long period Cepheids, respectively. 



Bandpass as 6s "'S o,l bL ctl 











Sample A, Ns 




1261, 


Nl = 69 














V ... 


-2 


771 ± 


041 


17.130 ± 0.025 





286 


-2.039 ±0 


222 


16 


409 ± 


258 





238 


R... 


-2 


919 ± 


036 


16.911 ±0.022 





245 


-2.263 ±0 


193 


16 


267 ±0 


224 





200 


J... 


-3 


143 ± 


029 


16.312 ±0.018 





129 


-2.884 ±0 


151 


16 


093 ± 


175 





132 


H ... 


-3 


234 ±0 


027 


16.078 ±0.017 





113 


-3.003 ± 


134 


15 


883 ±0 


155 





117 


K ... 


-3 


193 ±0 


028 


15.994 ± 0.017 





129 


-3.080 ± 


125 


15 


904 ±0 


145 





113 










Sample B, Ng 




1253, 


Nl = 69 














V ... 


-2 


787 ±0 


041 


17.129 ±0.025 





256 


-2.039 ± 


222 


16 


409 ±0 


258 





238 


R... 


-2 


933 ± 


036 


16.910 ±0.022 





220 


-2.263 ±0 


193 


16 


267 ±0 


224 





200 


J ... 


-3 


148 ± 


029 


16.312 ±0.018 





123 


-2.884 ±0 


151 


16 


093 ± 


175 





132 


H... 


-3 


235 ± 


027 


16.078 ±0.017 





110 


-3.003 ± 


134 


15 


883 ± 


155 





117 


K ... 


-3 


195 ± 


028 


15.994 ±0.017 





129 


-3.080 ±0 


125 


15 


904 ± 


145 





113 










Sample C, Ns 




1147, 


Nl = 69 














V ... 


-2 


867 ±0 


048 


17.184 ±0.030 





257 


-2.039 ± 


222 


16 


409 ±0 


258 





238 


R... 


-3 


005 ±0 


042 


16.959 ± 0.026 





221 


-2.263 ± 


193 


16 


267 ±0 


224 





200 


J ... 


-3 


179 ±0 


033 


16.333 ± 0.021 





122 


-2.884 ± 


151 


16 


093 ±0 


175 





132 


H... 


-3 


239 ±0 


030 


16.081 ±0.019 





109 


-3.003 ± 


134 


15 


883 ±0 


155 





117 


K... 


-3 


207 ±0 


030 


16.002 ± 0.019 





125 


-3.080 ± 


125 


15 


904 ±0 


145 





113 
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Table 5. The results of the piecewise continuous regressions in the form of m = c + d X min[log(P), 1.0)] + e X max[0, log{P) — 1.0] , 
where a is the dispersion of the regression. 



Bandpass 




c 




d 




e 






f7 








Sample A, = 1330 












V ... 


17, 


.128 ±0, 


.024 


-2.767 ±0.038 


-2, 


.000 ± 0. 


,155 


0, 


.284 


R... 


16, 


.908 ±0, 


.021 


-2.915 ±0.033 


-2, 


.219 ±0. 


,135 


0, 


.243 


J ... 


16, 


.305 ± 0, 


.017 


-3.130 ±0.027 


-2, 


.751 ±0. 


,106 


0, 


.129 


H ... 


16, 


.072 ± 0, 


.016 


-3.221 ±0.025 


-2, 


.889 ±0. 


,094 


0, 


.113 


K ... 


15, 


.989 ±0, 


.016 


-3.184 ±0.025 


-3, 


.004 ±0. 


,089 


0, 


.129 








Sample B, A = 1322 












V ... 


17, 


.125 ±0, 


.024 


-2.778 ± 0.038 


-1, 


.944 ± 0. 


155 


0, 


.255 


R... 


16, 


.906 ±0, 


.021 


-2.925 ±0.033 


-2, 


.170 ±0. 


135 


0, 


.219 


J ... 


16, 


.304 ± 0, 


.017 


-3.133 ±0.027 


-2, 


.735 ±0. 


106 


0, 


.124 


11 . . . 




(^71 -I- n 


.016 


-3.222 ±0.025 


O 

— z, 


.ooU it U. 




U. 


111 


K ... 


15, 


.989 ±0, 


.016 


-3.185 ±0.025 


-2, 


.999 ±0. 


,089 


0, 


.128 








Sample C, N = 1216 












V ... 


17, 


.172 ±0, 


.028 


-2.846 ± 0.044 


-1, 


.863 ±0. 


157 


0, 


.256 


R... 


16, 


.948 ±0, 


.024 


-2.985 ±0.038 


-2, 


.098 ±0. 


137 


0, 


.219 


J ... 


16, 


.321 ±0, 


.019 


-3.157 ±0.030 


-2, 


.708 ± 0. 


107 


0, 


.123 


H ... 


16, 


.072 ±0, 


.018 


-3.223 ±0.028 


-2, 


.879 ± 0. 


,095 


0, 


.109 


K ... 


15, 


.995 ±0, 


.018 


-3.194 ±0.028 


-2, 


.988 ± 0. 


,090 


0, 


.124 



Table 6. The results of the quadratic regressions in the form of m = a + /31og{P) + 7[log{P)]^, where cr is the dispersion of the 
regression. 



Bandpass 




a 






/3 




7 








<j 










Sample 


A, N = 1330 














V ... 


17, 


.126 ± 


0. 


,050 - 


-2.829 ±0.144 


0, 


.110 ± 


0. 


,097 


0, 


,285 


R... 


16, 


.914 ± 


0. 


,044 - 


-2.993 ±0.125 


0, 


.115 ± 


0. 


,084 


0, 


,244 


J ... 


16, 


.336 ± 


0. 


,036 - 


-3.259 ±0.102 


0, 


.123 ± 


0. 


,068 


0, 


130 


H ... 


16, 


.125 ± 


0. 


,035 - 


-3.412 ±0.098 


0, 


.159 ± 


0. 


,065 


0. 


,114 


K ... 


15, 


.996 ± 


0. 


,037 - 


-3.222 ±0.102 


0, 


.043 ± 


0. 


,066 


0. 


,129 










Sample B, N = 1322 














V ... 


17, 


.160 ± 


0. 


,050 - 


-2.961 ±0.144 


0, 


.200 ± 


0. 


,097 


0. 


,257 


R... 


16, 


.944 ± 


0. 


,044 - 


-3.108 ±0.125 


0, 


.193 ± 


0. 


,084 


0. 


,220 


J ... 


16, 


.346 ± 


0. 


,036 - 


-3.296 ±0.103 


0, 


.148 ± 


0. 


,068 


0. 


124 


H ... 


16, 


.131 ± 


0. 


,035 - 


-3.434 ± 0.098 


0, 


.174 ± 


0. 


,065 


0. 


111 


K ... 


16, 


.000 ± 


0, 


,037 - 


-3.236 ±0.102 


0, 


.052 ± 


0. 


,066 


0. 


128 










Sample C, N = 1216 














V ... 


17, 


.345 ± 


0, 


.072 - 


-3.435 ±0.197 


0, 


.477 ± 


0. 


126 


0. 


,258 


R... 


17, 


.114 ± 


0, 


.063 - 


-3.545 ±0.171 


0, 


.449 ± 


0. 


,109 


0. 


,221 


J ... 


16, 


.427 ± 


0, 


.050 - 


-3.502 ±0.136 


0, 


.267 ± 


0. 


,086 


0. 


123 


H ... 


16, 


.159 ± 


0, 


.046 - 


-3.505 ±0.124 


0, 


.215 ± 


0. 


,078 


0. 


109 


K ... 


16, 


.031 ± 


0, 


.046 - 


-3.317 ±0.123 


0, 


.100 ± 


0. 


,077 


0. 


124 



PL relation. For the (4-parameter) two-line regression 
model vs. the piecewise continuous regression model, it 
is true that either model may be appropriate. Again this 
has to wait for the future theoretical calculations to an- 
swer this problem. 

In short, from the above discussion, the main results 
we found from this study are: 

Based on the data used in this study, the LMC V- 
& R-band relattons are non-linear around a period of 10 
days. Assuming that veracity of the random phase correc- 
tions used for the JHK data, we also find the J- & H-band 
PL relations are also non-linear across this same period 
but the K-band PL relation is (marginally) linear. 



wise continuous regression model over the polynomial fit 
because of the following reasons: (a) the residual plots 
in Figure 2] suggest that the two-line/piecewise continu- 
ous regression is more applicable; (b) the (optical) PC 
relations are more consistent with two-line or piecewise 
continuous regression (see left panels of Figure below 
and the PC relation presented in ^amman n ct al. 20q3; 
iKanbur fc Ngeo w 2004: Sandage et all2004l) : (c) a simple 
period-mean density relation predicts a PL relation lin- 
ear in log(P) but not in [log {P)f; and (d) FigureElshows 
that the two-line or the piecewise continuous regressions 
are more appropriate to describe the data. Nevertheless 
it is clear this question is rather moot since a two-line or a 
piecewise continuous model with two separate slopes for 
short and long period Cepheids approximates a quadratic 
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5 CONCLUSION AND DISCUSSION 

By applying the statistical tests to the VRJHK-ha.nA 
PL relations obtained from the MACHO and 2MASS 
data, we find that the VRJH-h&nA PL relations show 
strong evidence that they are non-linear while the K- 
band PL relation is (marginally) consistent with a lin- 
ear PL relation. The non-linearity of the extinction cor- 
rected F-band PL relations as seen fr om two indepen- 
dent datasets, one from the OGLE data ("Xammann et al.1 
I2OO2I: iKanbur fc Ngeow 2004: Sandage et al. „2004i) and 
another from this study, strongly indicates that this non- 
linearity is real and not due to the artifacts of, for ex- 
ample, the extinctio n error and /o r the p hotometric re- 
ductions. Note that IPersson et alJ (|2004h suggested the 
NIR PL relations (based on 88 LMC Cepheids) are lin- 
ear. Their results may well be infiuenced by the small 
number of short period Cepheids (18) in their sample. 
Recall that Table |5| indicates how small numbers of short 
and/or long period Cepheids can lead to an F-test indi- 
cating consistency with one-line regression. We can apply 
the -F-test to th e 88 Cepheids ( after the extinction cor- 
rection) given in lPersson et alj ll2004ll . including the 18 
short period Cepheids. The F-test results are: Fj = 2.825 
(p = 0.065), Fh = 2.356 (p = 0.101) and Fk = 2.215 
(p = 0.116). These results indicate that the correspond- 
ing PL relations ar e linear, as expected (from Table |2J, 
and as reported in lPersson et all (1200411 . though the re- 
sult for the J-band is marginal. The reason for these non- 
significant results is because of the small number of short 
period Cepheids (however, see Section 5.2), leading to a 
short period slope that has a large error. However, we also 
see that our F-test results for this dataset has decreasing 
significance as the wavelength increases. This is exactly 
what we see in our results for the MACHO/2MASS JHK 
data. 

The reason that the PL relation is non-linear is be- 
cause t he PC relation for the LMC Cepheids is also non - 
linear llTammann et alj 120021: iKanbur fc Ngeowl l20o3) . 
This can be clearly seen from upper panels of Figure 
El for extinction corrected {V - R) k {V - K) PC re- 
lations. Since both of the PL and PC relations obey 
the more general PLC relations, the morphology of the 
PC relation will affect the morphology of the PL rela- 
tion (Madorc & Frccdman 1991). Recall that the lumi- 
nosity variation for the Cepheid variables in visual band s 
is dominated by the temperature variation JCoxlll980h . 
However, as the waveband moves toward the infrared, 
the radius variation begin to dominate the variation in 
the infrared PL relations (e.g., Cox 1980; Madorc 1985). 
Since there is no obvious reason that the period-radius re- 
lation for LMC Cepheids should be non-l inear (see, e.g., 
iBono et alll99^ : lGieren et al.ll998lll999^ . one would ex- 
pect that the PL relation is linear toward the infrared. 
From this study, we see that the PL relation is non-linear 
at VRJ- & _ff-band, and marginally linear at K-hanA. This 
suggests that the radius variation may begins to dominate 
at T^^band and longer wavelengths. This evidence can also 
be seen from the lower-right panel of Figure|Kl where the 
extinction corrected (J — K) PC relation becomes linear. 
These findings also imply that the non-linearity of the PL 
relations is due to the temperature (or colour) variations 



Table 7. The results of the chi-square test, with Hq =one-line 
regression and log (Po) = 1.0. 



Ha = 


two- lines 


piecewise 


quadratic 


Bandpass 






x' 




Sample A, 


N = 1330 




V ... 


19.47 


19.41 


1.282 


R... 


21.26 


21.16 


1.869 


J... 


11.69 


10.14 


3.222 


H ... 


11.14 


9.687 


6.063 


K ... 


3.870 


3.127 


0.426 




Sample B, 


N = 1322 




V ... 


23.35 


22.99 


4.235 


R... 


25.34 


24.89 


5.235 


J... 


13.06 


11.14 


4.652 


£1 . . . 


1 1 ni 

li.yi 


1 n OQ 


t .z to 


K ... 


4.166 


3.321 


0.631 




Sample C, 


N = 1216 




V ... 


30.85 


29.60 


14.38 


R... 


33.40 


31.94 


16.86 


J... 


16.02 


13.26 


9.553 


H ... 


11.52 


9.778 


7.527 


K ... 


4.981 


3.877 


1.698 



as a function of period. The detailed discussion and in- 
vestigation of the physical reason behind the non-linear 
PL and PC relation is beyond the scope of this paper, 
and it will be addressed in a future paper. 

Some attempts at using the long period LMC PL 
relation (from two-l ine regression) in dist a nce scale stud- 
ies ca n be found in I Ka nbur et alj ^|2003 ^. iLeonard et alj 
J2003l). iThim et air ( 2003). Thim et al.l ^|2004^ and 
I^essetan ~ il2005ri without any statistical justification. 
We have shown in Section 3 & 4 that there is strong sta- 
tistical evidence that the (optical) LMC PL relation is 
non- linear, hence the use of long period PL relation is jus- 
tified. iNgcow & Kanbur (2005) has discussed the effect of 
using the broken LMC PL relations (two-line regression) 
in distance scale applications: they found that this effect 
is minimal with a systematic error of 0.03 mag. or less for 
the derived distance modulus. Therefore the finding of a 
non-linear LMC PL relation is more important in terms 
of the studies of stellar pulsation and evolution to under- 
stand the physical reasons that cause the non-linearity 
than in distance scale applications. However, in the era 
of precision cosmology, it is important to use the latest 
understanding of Cepheid PL relations. 

We now discuss some relevant issues that may affect 
our results in the following subsections. 

5.1 Issue of outliers 

Could outliers be one cause for our results? - KN tested 
this extensively for both PL and PC relations and con- 
cluded that outliers cannot be responsible for the break. 
Figure 121 using OGLE data and depicting both PC and 
PL relations at pulsation phase close to 0.82, is remark- 
ably clear of outliers: the break or non-linearity at a pe- 
riod of 10 days is compelling. The fact that our analysis 
of MACH0/2MASS data produce results consistent with 
our previous analysis of the OGLE data set suggests that 
outliers cannot be responsible for the results presented 



© 0000 RAS, MNRAS 000, 000-000 



Non-Linear LMC PL Relations 13 



o.a 




0.5 t 1,5 0.5 1 IS 

IobW log(P) 



Figure 5. The selected PC relations after extinction correction, constructed with the Sample C data as mentioned in Section 2. 



Table 8. The results of the _F-tost, with Hq =one-line regres- 
sion and log{Po) = 1-0. 



Ha = 


two-lines 


piecewisc 


quadratic 


Bandpass 


F 


F 


F 




Sample A, 


N = 1330 




V... 


6.367 


12.71 


0.832 


R... 


7.178 


14.30 


1.251 


J... 


8.988 


15.58 


4.908 


H ... 


9.539 


16.57 


10.32 


K ... 


2.923 


4.723 


0.642 




Sample B, 


N = 1322 




V ... 


9.428 


18.58 


3.383 


R... 


10.53 


20.70 


4.301 


J ... 


10.91 


18.57 


7.692 


H ... 


10.68 


18.33 


12.98 


K ... 


3.182 


5.072 


0.960 




Sample C, 


N = 1216 




V ... 


12.32 


23.64 


11.37 


R... 


13.76 


26.31 


13.75 


J ... 


13.37 


22.07 


15.83 


H ... 


10.30 


17.46 


13.40 


K ... 


3.788 


5.891 


2.573 



in this paper. Nevertheless, we adopt a robust regression 
method which limits the influence of outlie rs by using 
Tukey 's Bi- weight function as, e.g., given in lPress et al.l 
(1992") to the F-band data with two-line vs. one-line re- 
gression. The general idea is to suppress the influence of 
points with large deviation from the expected model. The 
F-test results are: Fy = 5.869 (p = 0.003) for Sample A, 
Fv = 5.924 (p = 0.003) for Sample B and Fv = 9.196 
(p = 0.000) for Sample C, respectively. Minimizing the 
sum of the absolute deviations rather than the sum of 
the squares also yields broadly si milar results. Another 
approach is the RANSAC algorithm (iFisher fc BoUej[l98ll : 



IStorkev et al 11200411 used to detect linear structure in the 
presence of many outliers. This algorithm tries to find out 
how much supp ort a given model ha s among the data. 
Equation (1) of IStorkev et al.l ((ioO^) shows how many 
times we have to repeat the algorithm for a given prob- 
ability of failure. Given that we have over ~ 1300 data 
points this algorithm only makes sense if we have over 
~ 200 outliers and we do not believe this t o be the case. 
We als o tried the sigma-clipping algorithm iUdalski et alJ 
Il999al) to remove more outliers, and again we found that 
our results remain unchanged. 



All the above methods are parametric and make spe- 
cific assumptions about the form of the regression surface. 
The LOESS (Local Linear Smoothing) procedure in SAS 
implements a non-parametric metho d for implement ing 
a non-parametric regression surface (IClevelandlll979l) . If 
we have y{i) — g[x{i)] + e{i), where x,y are the predic- 
tors and independent variables respectively, LOESS works 
by approximating the value, g, of the regression function 
locally, say at xq, by the value of a function in some spec- 
ified parametric class. This is done by fitting a regression 
surface within a chosen neighborhood of xq. The fraction 
of the data points contained in this neighborhood is the 
smoothing parameter. This procedure is suitable when 
there are outliers and robust techniques are required. Fig- 
ure|S|portrays our results using this method when applied 
to the extinction corrected MACHO l/-band data in Sam- 
ple C. The solid line in this figure is the non-parametric 
fit and the points are the original data. The non-linearity 
of the fit between 0.8 < log(P) < 1.1 is self-evident, and 
these results are insensitive to the smoothing parameter 
used. What we take from this is that our results are, to 
a large extent, independent of fitting method and robust 
to outliers. 
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1 

ioqP) 



Figure 6. A robust non-parametric fit (soHd fine) between 
log(P) and the extinction corrected V-band MACHO data. 



5.2 Issue of long period Cepheids 

Could the small number/incompleteness at the long pe- 
riod end be responsible to our results? - If this is the 
case, then these effects should act in the same way at 
all phases. Again Figure |21 displays a preliminary re- 
sult: the reddening corrected PC and PL relations for 
OGLE data at the pulsation phase of 0.82 in the left 
and right panels, respectively. If the true underlying PC 
relation at this phase is indeed linear, then the com- 
bination of selection effects/incompleteness/period cuts 
acts so as to make the longer period Cepheids appear 
cooler than they really are. From observa tions, longer 
period Cepheids obey a flat PC relation JCodj 
I Simoii et al.lll993l:lKanbur fc Ngeo^l2004l: Kanbur et al.l 
120041: iKanbur fc Ngeow ''2005\ Fi gure 2 of KN provides 
evidence for this for LMC Cepheids with the OGLE data. 
We have also evaluated the {V — R) colour at maxi- 
mum light for the MACHO LMC Cepheid sample and 
also find a flat PC relation for these stars at maximum 
light (Kanbur et al 2005 - in preparation). However, if 
a combination of small number/incompleteness at the 
long period end/selection effects is responsible for our re- 
sults, then the true distribution at maximum light must 
be such that longer period Cepheids get hotter or bluer 
as the period increases^. There is very credible physics 
that predicts t he true PC relation at maximum ligh t 
should be flat JSimon et al.lll993l: iKanbur et ai]l2004h . 
Thus it is our contention that a combination of selec- 
tion/incompleteness/period cuts cannot be responsible 
for our results. 

The small number of lo ng p eriod Cepheids is ex- 
pected as lAlcock et"ai] (Il999l) and lUdalski et alj (Il999bl) 
showed that the period distribution for the LMC 
Cepheids peaked at log(P) ~ 0.5 with a long tail ex- 



^ Note that Cepheids can generally get cooler as periods in- 
crease, due to the morphology of the instability strip. 



tended out to the long period end. This is because in 
general Cepheids with longer period have higher mass. 
It is well known that due to the initial mass function 
{dN/dM oc M~" , where M is the mass of the stars and 
a > 0), the number of high mass stars is expected to 
be le ss than the low mass stars (see, e.g., lAlcock et alj 
Il999f) . In addition, the crossing time (across the instabil- 
ity strip) for the high mass stars is shorter than their low 
mass counterparts ( Bono et_aL„2000a) . These two effects 
reduce the number of long period Cepheids in compar- 
ison to the number of short period Cepheids. To show 
our results do not suffer from the missing long period 
Cepheids with log(P) > 1.5, we add extinction corrected 
F-band data f or the 18 longes t period (i.e, log[P] > 1.5) 
Cepheids from lSandaee et al.l (|2C)04) to our sample. We 
then apply the P-test to the combined sample and test 
whether the data are more consistent with a one-line 
or two-line regression. The resulting P-value is 8.734, 
which implies that the LMC T/-band PL relation is still 
non-linear with the additional data. We also tried to in- 
clude aU 97 Cepheids (with 0.4 < log[P] < 2.2) from 
Sandagc et al. ( 2 004t) to our sample, and the P-test re- 
sult still supports the non-linear PL relation. 

A more rigorous test, as suggested by an anonymous 
referee, is to increase the dataset by adding long period 
Cepheids from a homogeneous dataset that does not be- 
long to the MACHO samp le. Such data exist in the lit- 
erature, for example from ISebo et al.l (|20o3) which we 
will use in our test. We first match the Cep heid coordi- 
nates and periods in our MACHO data and ISebo et alj 
||20o3) sample to identify the common Cepheids in both 
samples. We find 203 such stars. For these common 
Cepheids, we calculate the average difference. A, be- 
tween the V and R mean magnitudes in these two sam- 
ples, in the sense that Av = Vmacho — Vsebo and 
Afl — Rmacho — Rsebo- After removing some obvi- 
ous outliers (A > 1), we obtain: Av ~ 0.041 (with std 
= 0.121) and A^ = 0.039 (with std = 0.120). This step is 
to ensure the lSebo et alJ (j20o3) photometric system is the 
same as MACHO photometric system by adding A to the 
ISebo et alj i2002l) mean magnitudes. In addition, we also 
found that there is no period dependency in this trans- 
formation: Av = 0.037(±0.034) log(P) + 0.013(±0.027), 
Ar = -0.028(±0.034)log(P) 0.060(±0.027). If there 
are any dependencies with period in such a transforma- 
tion, it could be that this has influenced our results. We 
see clearly that this is not the case. Similarly, the av- 
erage differen c e in J HK mean magnitudes between the 
IPersson et aU l^2004^ sample and the MACH0/2MASS 
sample is negligible, and more importantly, independent 
of periods (see Section 2 for details). 

These photometric transformations (albei t very close 
to zer o) can now be used to transform the ISebo et alJ 
(|20o3) data to the same photometric system as the MA- 
CHO V- and P-band data used in this study, and to trans- 
form the MACHO /2MASS JHK data to the same system 
as the lPersson et af] i2004ll data. This procedure provides 
a consistent way of increasing our long period sample 
by identifying those stars not in the MA CHO /2M ASS 
sample but w h ich ar e in either the ISebo~et al. (2002) or 
IPersson et alJ (120041) samples. We now describe our re- 
sults when this is carried out. 
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The long pe riod (log[P] > 1.0) Cepheids in the 
ISebo et all sample that are not in the MACHO 

sample are readily identified. We found 49 of them^ 
and they are listed in Table |^ In this tab le, we give 
the or iginal V and R mean magnitudes from ISebo et alJ 
(120021) . as well as the number of observed data points 
{mv.R) for each Cep heids in either bands. The periods are 
adopted form either 'Persson et'al] (|20^ or the OGLE 
database ( Udalski et al. 1999b|). The table also lists the 
JHK mean magnitudes and the E( B — V) values for these 
Cep heids that are compil ed from iPersson et al.l (|200^ 
and lUdalski et allll999bl) , or we set E{B - V) = 0.10 
if no entries a r e giv en in IPersson et alJ (|2p04!) . Since 
the lSebo et al.l ll2002h database lacks Cepheids between 
log(P) ~ 1.55 and log(P) ~ 1.95, we include 8 Cepheids 
in this period range (with th e available JHK mean mag- 
nitudes) from IPersson et al.l ( 2004) in Table |5] for com- 
pleteness. We did not attempt to add the V and R mean 
m agnitude for these 8 Cepheids from the literature, as 
in lSandaee et all ll2004ft . in order to keep our dataset as 
homogeneous and consistent as possible. 

The data in Table H are then added to our MA- 
CHO/2MASS sample, after the appropriate photometric 
transformation. The F-tests are applied to this extended 
sample for Ha = two-lines regression (with a break at 10 
days) using the following selection criteria: 

(i) Include all available long period Cepheids in Table 
1^1 appropriate to the bandpass, to our MACH0/2MASS 
sample. 

(ii) Eliminate Cepheids without JHK mean magni- 
tudes since not all of the Cepheids in Table |5| have the 
JHK means. 

(iii) Eliminate Cepheids with mv.R 55 10 as the mean 
magnitudes derived from these sparsely sampled light 
curves may not be accurate. 

(iv) Combine step (ii) & (iii) above (item 5 in Table 
I10|l in order to have consistent number of Cepheids in all 
five bands. 

The results from these different selection crite r ia are pre- 
sented in Table [TOl Note that IPersson et'ai] (l2004l) did 
not use Cepheids with log(P) > 2.0. These Cepheids are 
eliminated in step (iv) above (item 5 in Table [111 . The 
linearity of the PL relation in each band again can be 
tested by comparing the P- values in this table with Table 
Q As can be seen from the table, including the additional 
long period Cepheids does not alter the main conclusion 
we have: the VRJH-hanA PL relation is non-linear and 
the K-hanA PL relation is (margina lly) linear. Since t here 
are no measurement errors given in lSebo et al.l ((20o3) for 
the V and R mean magnitudes, the results in Table 1101 
are derived with unweighted regression (i.e. lii = 1) in all 
bands. The P- values from using the weighted regressions 
to all bands, with assumed some values for the errors in 
the V- and -R-band means (and the E\B — V], the errors 
for Jffii' means are given in IPersson et alJl2004l) . are very 
similar to Table [Till One exception is the _F- value for the 

^ Some OGLE Cepheids identified by ISebo et al.l <2002h are 
not fundamental mode Cepheids and we exclude them in the 
table. 



if-band when using all available long period Cepheids, 
the P-value goes down from 2.942 to 1.532, which is fully 
consistent with a one-line regression. In this table, except 
for the _ftr-band, the lowest significant P-value is 7.872. 
With the degrees of freedom appropriate for the data in 
our study, this has a p-value less than 10"'^ (recall that 
p < 10^^ for F > 7). This means that assuming the 
null hypothesis of a single regression line being sufficient. 
The chance of getting a value of the F statistic as high or 
greater than that actually observed (7.872) is less than 
10"^ 



5.3 Issue of reddening/extinction 

Could reddening and extinction errors produce our re- 
sults? - Figure |5] displays extinction corrected PC rela- 
tions, at mean light, obtained from the MACH0-)-2MASS 
data for a variety of colours: (y — -R)o, {V—J)o, {V — K)o 
& ( J — K) o. This figure and the PC relations presented 
in KN and lSandaee et aH (|20o3) imply that in order to 
make a linear PC relation, extinction errors would have 
to be of the order of ~ 0.05-0.2 mag. for the longer pe- 
riod Cepheids. If this were indeed the case, the same error 
would apply to PC relations at maximum light. Again, 
longer period Cepheids obey a flat PC relation at max- 
imum light (KN, SKM, Code 1947), an observation for 
which there is a sound theoretical argument (see the refer- 
ence given in Section 5.2). If these longer period Cepheids 
suffered an extinction error of ~ 0.05-0.2 mag., then the 
true underlying PC relation at maximum light would be 
such that as the period increases the colour becomes bluer 
(that is the temperature is hotter) at maximum light. It 
is hard to incorporate this in the theoretical framework 
of ISimon et alj I^Q^), since this paper suggested that 
for very long period Cepheids (log[P] > 1.5 roughly), 
the PC relation at maximum light then starts to have 
a slope such that Cepheids become redder as the period 
gets longer. This is in the opposite way to what would 
need to be true were the PC relation to be linear at mean 
light. Further, if reddening/extinction errors are indeed 
making the PC/PL relation appear to be non- linear then 
the extinction/reddening from two independent methods, 
as used by the OGLE team and that used in this study, 
would both need to contain errors of the order of ~ 0.05- 
0.2 mag. for the longer period Cepheids. 

Although longer period Cepheids are generally 
younger and as such may be surrounded by dust shells 
which could increase extinction with period and possibly 
produce a non-linear PC relation as is observed. We argue 
again that our knowledge of the properties of Cepheids 
at maximum light as presented above argues against this. 
Further such an effect would be gradual, however the ob- 
served PC relation is sharply non-linear as is evident in 
the left panel of Figure|21 The use of two extinction maps, 
one for the OGLE data and another one in this study, to 
correct for the extinction/reddening that produce similar 
results also suggest the effect of the extinction is minimal. 

The anonymous referee has also suggested the fol- 
lowing to test to examine if our extinction corrections 
do indeed represent individual corrections: plotting the 
residuals from the V- and P-band PL relations against 
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Table 9. Additional long period Cepheids from lSebo et al.l J2002^ and lPerason et al.l <2004) . 



Name log(P) my mR V R J H K E(B - V) 



Oglel00794 


1 


.0298 


17 


17 


14.78 


14.30 










0.138 


Ogle50505 


1 


.0356 


14 


13 


14.69 


14.23 










0.127 


HV2432 


1 


.0384 


7 


5 


14.24 


13.96 


12, 


.986 


12.653 


12.568 


0.100 


Ogle96614 


1 


.0862 


20 


19 


14.58 


14.10 










0.138 


OgIell7525 


1 


.1016 


18 


16 


14.43 


13.92 










0.128 


OgIel73734 


1 


.1045 


6 


4 


14.72 


14.20 










0.135 


iiVi5i7 


1 


.1122 


33 


32 


14.62 


14.16 


13, 


.084 


12.684 


12.570 


0.070 


HV2260 


1 


.1136 


16 


14 


14.85 


14.35 


13, 


,263 


12.860 


12.736 


0.130 


Ugle27Uo7y 


1 


.1233 


11 


10 


14.38 


13.85 










0.138 


HV2579 


1 


.1281 


10 


7 


14.04 


13.63 


12, 


.688 


12.334 


12.236 


0.100 


HV955 


1 


.1377 


8 


7 


14.02 


13.62 


12, 


.686 


12.342 


12.251 


0.058 


HV2538 


1 


.1420 


6 


3 


14.50 


14.05 


12, 


.864 


12.452 


12.343 


0.100 




1 


.1450 


10 


8 


14.22 


13.81 


12, 


.748 


12.381 


12.295 


0.100 


HV5655 


1 


.1526 


16 


11 


14.52 


14.06 


12, 


.971 


12.553 


12.443 


0.100 


Oglel67787 


1 


.1610 


11 


6 


14.09 


13.62 










0.145 


lrLV12471 


1 


.2001 


9 


4 


14.73 


13.95 


12, 


.903 


12.453 


12.308 


0.058 


Ogle55470 


1 


.2039 


9 


6 


14.65 


14.08 










0.121 


OgIel82466 


1 


.2039 






14.31 


14.08 










0.127 


HV2549 


1 


.2094 


9 


8 


13.67 


13.36 


12, 


.422 


12.078 


11.982 


0.058 


Oglel60625 


1 


.2358 


19 


14 


13.78 


13.34 










0.138 


HV2261 


1 


.2370 


11 


8 


13.26 


12.90 










0.100 


HV2580 


1 


.2285 


17 


12 


14.01 


13.53 


12, 


.489 


12.102 


11.999 


0.090 


HV2836 


1 


.2437 


13 


9 


14.63 


14.07 


12, 


.747 


12.276 


12.127 


0.180 


Ogle250872 


1 


.2709 


10 


11 


13.97 


13.52 










0.152 


Ull 


1 


.3026 


9 


6 


13.66 


13.23 


12, 


,383 


11.972 


11.860 


0.100 


Oglel48920 


1 


.3160 


21 


18 


13.60 


13.08 










0.147 


OgIe88441 


1 


.3276 


4 


4 


13.31 


13.00 










0.142 


Ogle286532^ 


1 


.3347 


12 


12 


14.01 


13.44 










0.138 


Oglel09640 


1 


.3486 


18 


16 


14.00 


13.51 










0.138 


Ul 


1 


.3533 


9 


5 


14.09 


13.62 


12, 


.350 


11.916 


11.790 


0.100 


HV876 


1 


.3561 


10 


7 


13.53 


13.32 


12, 


.144 


11.766 


11.665 


0.100 


HV878 


1 


.3673 


33 


27 


13.49 


13.07 


12, 


,097 


11.735 


11.634 


0.058 


HV938 


1 


.3724 


5 


5 


13.31 


12.91 










0.100 


HV6098 


1 


.3845 


8 


6 


12.95 


12.56 


11, 


.733 


11.405 


11.317 


0.100 


HV902 


1 


.4209 


6 


4 


13.25 


12.70 


11, 


.833 


11.469 


11.378 


0.070 


HV1023 


1 


.4239 


22 


19 


13.77 


13.20 


12, 


.041 


11.618 


11.498 


0.070 


Oglel62232 


1 


.4828 


16 


11 


13.48 


12.88 










0.152 


HV872 


1 


.4750 


11 


9 


13.69 


13.15 


11, 


.976 


11.540 


11.422 


0.100 


HV875 


1 


.4822 


16 


16 


12.96 


12.58 


11, 


.620 


11.285 


11.192 


0.100 


Ogle228645 


1 


.4921 


11 


7 


13.26 


12.83 










0.129 


HV882 


1 


.5027 


18 


15 


13.35 


12.90 


11, 


.714 


11.328 


11.202 


0.070 


HV5761 


1 


.5046 


5 


3 


13.14 


12.38 










0.100 


HV873 


1 


.5359 


16 


14 


13.06 


12.58 


11, 


.490 


11.109 


10.998 


0.130 


HV881 


1 


.5534 


16 


15 


13.09 


12.61 


11, 


,527 


11.142 


11.032 


0.030 


HV2294 


1 


.5626 










11 


.237 


10.868 


10.770 


0.070 


HV879 


1 


.5656 






13.64 


13.01 


11, 


.590 


11.157 


11.031 


0.060 


HV909 


1 


.5749 






13.03 


12.54 


11, 


.339 


10.984 


10.884 


0.058 


T TT 7r\ r" ^ 

HV2257 


1 


.5943 


9 


8 


13.05 


12.53 


11, 


.355 


10.944 


10.833 


0.060 


HV2338 


1, 


.6248 










11, 


.185 


10.798 


10.690 


0.040 


HV877 


1, 


.6548 










11, 


.438 


10.962 


10.835 


0.100 


HV900 


1, 


.6771 










11, 


.156 


10.751 


10.632 


0.058 


HV953 


1, 


.6802 










10, 


.788 


10.423 


10.317 


0.070 


HV2827 


1, 


.8969 










10, 


.428 


9.976 


9.851 


0.080 


HV5497 


1, 


.9965 


9 


6 


11.93 


11.38 


10, 


.027 


9.602 


9.466 


0.095 


HV2883'' 


2, 


.0398 










10, 


.651 


10.237 


10.098 


0.100 


HV2447 


2, 


.0763 










10, 


.106 


9.704 


9.565 


0.100 


HV883 


2, 


.1268 


8 


6 


12.00 


11.35 


10, 


.302 


9.919 


9.752 


0.100 



^This Cepheids has relatively low amplitude iKanbur & : Ngeow 2005). 
°This Cepheids has relatively high amplitude iKanbur fc Ngeow,i.2005ll . 
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Table 10. F-test results with additional long period Cepheids, where Nf"^^ is the total number of long period Cepheids (MACHO 
+ additional Cepheids from Table 1^ . 



Bandpass 


V 




R 




J 




H 




K 




Criteria 


long 


F 


Long 


F 


Long 


F 


Long 


F 


long 


F 


1. use all available long period Cepheids 


118 


7.872 


118 


8.640 


106 


12.33 


106 


10.61 


106 


2.942 


2. exclude Cepheids without JHK means 


98 


10.96 


98 


12.45 














3. Cepheids with mv > 10 


94 


8.748 






81 


13.71 


81 


10.29 


81 


3.849 


4. Cepheids with m/j > 10 






89 


10.32 


79 


12.70 


79 


9.603 


79 


3.681 


5. combine item (2), (3) and (4) 


79 


10.86 


79 


12.38 


79 


12.70 


79 


9.603 


79 


3.681 



each other (see, e.g lSasselov et aDllQQTl : iBeaulieu et alJ 
l200ir) . The idea is that if the residuals lie along the red- 
dening line, then the reddening corrections do not rep- 
resent individual corrections. We carried out such a test 
with Sample C using the PL relations from a one-line 
and two-line regression. The results are presented in Fig- 
ure [3 From the fi gure it can be seen that the dispersion 
is reduced when using the one-line regressions as com- 
pared to the two-line regression. The points lie along the 
reddening line. 

We, however, argue that this test is not suitable for 
testing the extinction corrections for the following rea- 
sons. First of all, besides the statistical fluctuations, the 
residuals from the PL relation are expec ted to originate 
from three sources iSasselov et alJli997^ : (a) depth dis- 
persion/effects (b) extinction, and (c) intrinsic dispersion 
due to the width of the instability strip. All three sources 
will tend to make the residuals be correlated in the V- 
and 7?-bands, as seen in Figure We ignore the dis- 
cussion of the depth effect (represented as dashed lines 
in Figure [7| because we assume the depth effect is neg- 
ligible for the LMC Cepheids. For the source from ex- 
tinction, recall that the the extinction correction for a 
Cepheid is calculated using A\ — R\E{B — V), where 
the E{B — V) value is either the mean value of the LMC 
or an estimated value for individual Cepheids. Hence a 
correlation of the residuals is naturally introduced (repre- 
sented as solid lines in Figure EJ for the reddening vector 
used in this paper (Section 2), because R\/Rv — con- 
stant. Even for a hypothetical group of Cepheids that are 
free from extinction and depth dispersion, the residuals 
of the V- and 7?-band PL relations will still be correlated. 
This is due to the existence of th e instability strip and 
the PLC relation iSandage| Il958l : iMadore fc Freedniaiil 
llQQlTl . Therefore mapping the Cepheids from the insta- 
bility strip (along the constant-period line) to the V- and 
7?-band PL relations will cause the residuals to be un- 
avoidably correlated. Unfortunately, this line of residuals 
is very close to the reddening line in the residual plot s 
(not shown in Figure |71 see also ISasselov et ahl Il997t) . 
This makes it hard to say whether the correlation of the 
residuals seen in Figure |7| is caused by the extinction or 
the width of the instability strip*. 



In this paper we have tried our best to correct for 
the extinction of the LMC Cepheids. Based on the above 
arguments we believe that extinction errors are not the 
physical cause for the observed non-linear LMC PL re- 
lations. Ot her physical reasons, such as the pu lsational 
properties jBono et alJll99gl: ICaputo et alJl2000ri and /or 
the internal structure (Simon et al.lll993l: iKanbur et all 
2004; Kanbur & Ngcow 2005J) of the Cepheid variables 
(we refer to these as "internal reasons" as opposed to 
the "external reasons" like dust extinction) may be re- 
sponsible for the observed non-linear PL relations. We 
have provided compelling evidence that the (extinction 
corrected) Cepheid PL relation in the LMC is non-linear 
around a period of 10 days in the optical V- & 7?-bands. 
Assuming the random phase corrections methods used in 
this study, we also find the PL relation in the J- & H-hand 
to be non-linear around this period but that the K-hand 
relation is marginally linear. An investigation into the 
physics behind this is left for future studies. 
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